Second Order Asymptotics for Quantum Hypothesis Testing 
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In the asymptotic theory of quantum hypothesis testing, the error probability of the first kind 
jumps sharply from zero to one when the error exponent of the second kind passes by the point of 
the relative entropy of the two states, in an increasing way. This is well known as the direct part and 
strong converse of quantum Stein's lemma. Here we look into the behavior of this sudden change and 
have make it clear how the error of first kind grows according to a lower order of the error exponent 
of the second kind, and hence, we obtain the second order asymptotics for quantum hypothesis 
testing. Our method is elementary, based on basic linear algebra and probability theory. It deals 
with the achievability part and the converse part in a unified framework, with a clear geometric 
picture. 



Hypothesis testing is an important subject in mathe- 
matical statistics and information theory. A typical sce- 
nario which is of fundamental significance is the asymp- 
totic hypothesis testing problem with two hypotheses, 
each being many copies of independent and identically- 
distributed instances, occurring according to some given 
statistical description p or er, respectively. Here the sta- 
tistical descriptions p and a are probability distributions 
in classical setting and quantum states which are positive 
semi-definite matrices with trace 1 in quantum mechan- 
ics. The task is to minimize the error probabilities of 
mistaking one hypothesis for the other under certain fig- 
ure of merit. 

Classically, this problem has been well understood [H- 
Moving to the quantum case, it becomes much more 
difficult due to the non-commutativity of the quantum 
states p and cr, and the more complicated mechanics for 
observing the physical systems of interest(i.e., quantum 
measurement). Substantial achievements have already 
been made in the asymptotic theory of quantum hypoth- 
esis testing. Most notably, these include the establish- 
ing of the quantum Stein's lemma with a strong con- 
verse @, HI , the quantum ChernofT bound 0, [l(J , and the 
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quantum Hoeffding bound [11 - 
tant progress can be found in 

The Second order asymptotics is another interesting 
topic in mathematical statistics (2pj and information the- 
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In asymptotic analysis, not only does the 
second order term improve the accuracy of solution to 
the problem of concern, but also it plays a dominative 
role at points where the quantity of concern is asymp- 
totically discontinuous as a function of the first order of 
some other quantity. This is exactly the case for capac- 
ity of classical channels. When the number of channel 
uses approaching infinity, the average error probability 
can be if the transmission rate is smaller than capac- 
ity, and it becomes 1 if the transmission rate is larger 
than capacity [2?], HU . Recently, the exact relation how 
the average error probability depends on a smaller order 
of the transmission rate, in the case that the transmis- 
sion rate (of the first order) equals the capacity, has been 



derived independently in [25| and [26|, which proves to 
be quite useful in evaluating the channel coding rate for 



finite blocklcngth case [25 



There is a deep connection between hypothesis test- 
ing and channel capacity, both in the classical regime 
and in the quantum regime (3l|. Recently, this 



29, 3 



connection has been generalized to the one-shot scenario 
as well 32|, 33 1 . Indeed, such a connection is very helpful 
in the derivation of the second order coding rate in classi- 
cal channel coding [2^]. In contrast to its classical coun- 
terpart, which is an straightforward consequence of the 
central limit theorem, the second order asymptotics for 
quantum hypothesis testing is still unknown. In this pa- 
per, we solve this problem. Due to the above-mentioned 
connection, our result also make it possible to investigate 
the second order asymptotic behavior of classical infor- 
mation transmission over quantum channels j34| . 

Given a large number n of identical quantum systems, 
which are either of the state p® n (the null hypothesis) 
or of the state <7® n (the alternative hypothesis), we want 
to identify which state the systems belong to. Without 
loss of generality, this can be done by applying a two- 
outcome positive operator-valued measurement (POVM) 
{A n , 1 - A n }, with < A n < 1, on the joint Hilbert 
space H® n of the quantum systems. If we obtain the out- 
come associated to A n , then we conclude that the state 
is p® n . Similarly, the outcome associated to (1 — A n ) 
corresponds to the state er®". The error probabilities of 
the first kind and the second kind (also known as the 
type I and type II errors) are, respectively, defined as 
a n {A n ) := Tr(p® n (l - A n )) and p n (A n ) := Tr(a® n A n ). 
Thus, a n (A n ) is the probability that we incorrectly ac- 
cept cr* 8 " while it is actually p® n , and f3 n (A n ) is the prob- 
ability of the opposite situation. 

Obviously, these two kinds of errors can be made arbi- 
trarily small when n is big enough, unless p = a, because 
p®n an j a ®n are aS y m ptotically distinguishable. In an 
asymmetric setting, we want to maximize the exponen- 
tial rate at which the type II error goes to 0, under the 
condition that the type I error simply converges to 0. 
Quantum Stein's lemma tells us that this maximal expo- 
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nent is the quantum relative entropy, which is given by 
D(p\\a) = Tr(p(logp — logo - )) if supp(p) C supp(cr) and 
D(p\\&) = oo otherwise. It also says that if the type II er- 
ror goes to with an exponent bigger than D(p\\a), then 
the type I error must converge to 1. Formally, we have 
the statement as follows, (direct part Q): For arbitrary 
R < D(p\\a), there exists a sequence of measurements 
{A n ,t - A n } n , such that liminf =± log f3 n (A n ) > R 

and lim a n (A„) = 0; (strong converse [8[): if a se- 

n— > oo 

quence of measurements {A ni 1 — A n } n is such that 
liminf =±\ogP n (A n ) > D(p\\a), then lim a n (A n ) = 1. 

n— foo n— >oo 

So, from the quantum Stein's lemma, we see that the 
error probability of the first kind jumps sharply from 
zero to one when the error exponent of the second kind 
passes by the relative entropy D(p\\a) from smaller to 
larger. One would ask how this sudden change happens 
at the point that the error exponent of the second kind 
is exactly D(p\\ a). Intuitively, it should depend on a 
smaller order of the error exponent. In this paper, we 
have determined what this smaller order is, and how the 
error probability of the first kind depends on it. After 
defining the error-dependency function, we present our 
result in Theorem [2] as follows. 

Definition 1 We define the function a n (Ei, E 2 \f), 
which reflects the dependency of the optimal error proba- 
bility of the first kind on the error exponent of the second 
kind, up to the order n and y/n, as 

a n (E 1 ,E 2 \f) := 

\a n (A n ) (3 n {A n ) < exp(- {E in + E 2 ^/n~ + /(«)))} , 

(1) 



mm 



where f(n) is a function of some order other than n and 
y/n, which is to be specified when necessary. 

Theorem 2 Let $(a;) be the cumulative distribution 
function of the standard normal distribution, i.e., 
:= -A= f x e~* / 2 di, we have 

V / V27T J "OO 

lim a n {E 1 ,E 2 \f) 

n— >oc 

'0 ifE x <D(p\\o-),f£o{n) 
ifE x =D{p\\o-),f&o{^n) 



^/V{ P \\a) / 

1 if Ex >D(p\\o-),f€o(n), 



(2) 



where V(p\\o~), which we name the quantum relative vari- 
ance of p and a, is defined as 

V(p\\a) := Trp(logp- loga) 2 - D 2 (p\\a). 

Remark 1: The first and third cases of Eq. ([2]) in The- 
orem [2] are nothing else but the direct part and strong 
converse of quantum Stein's lemma, respectively. In fact, 
these two cases can be derived from the second case (see 



the "Proof of Theorem [2]' section). We include them in 
our theorem such that one easily gets the full information 
at first sight. Besides, our result of the first case is a bit 
stronger than the usual statement of the direct part of 
quantum Stein's lemma, because the former is equivalent 
to saying "for arbitrary R < D(p\\a) and arbitrary {/?„}„ 
satisfying lim — log f3 n = R, there exists a sequence of 

n— >oo n 

measurements {A„,l — A n } n , such that /3 n (A n ) < /3 n 
and lim a n (A n )=0". 

n— ^oo 

Remark 2: If supp(p) ^ supp(er), we have D{p\\a) = 
+00. Asymptotically, the optimal error probability of the 
first kind is always 0, while the error exponent of the sec- 
ond kind can be arbitrarily large. In such a case, the sec- 
ond order asymptotics and the strong converse make no 
sense. So, in this paper, we suppose supp(/o) C supp(cr), 
and without loss of generality, we further suppose a is of 
full rank. 

Proof of Theorem [H At first, we argue that it suffices 
to prove the second case, namely, for any /(n) G o(^/n), 



lim a n {D(p\\o-),E 2 \f) = $ 



( E 2 

\Vnph), 



(3) 



From Definition [I] it is easy to sec that, for arbitrary 
Ex < D(p\\a), E 2 eR,E' 2 e K, f{n) e o(n) and f'(n) € 
o(y/n), there exists G N, such that 



a n (Ex,E 2 \f)<a n (D(p\\a),E! 2 \f 



(4) 



holds for all n > N . Now, in Eq. letting n — > 00 at 
both sides and then letting E' 2 — > — 00 at the right side, 
we get from Eq. J3|) 

0< lim a n {Ex,E 2 \f) 

n—too 

< lim Hm a n {D{p\\a),E' 2 \f) 

EL— > — 00 n— >oo 



lim $ 

E~— y — 00 



fE^ 



= 0, 



which is exactly the first case of Eq. (|2|). In a similar 
way, we can prove that the third case of Eq. <j2j) is also 
contained in the second case. Then, we show that Eq. ((3|) 
is a direct consequence of Theorem [3] presented below. 
By the definition of a n (Ex, E 2 \f), it is not difficult to see 
that, the achievability part of Theorem [3] implies 



limsupa„(£>(p||<7),£ 2 |/) < * 



Eo 



(5) 



for any f(n) G o{^/n) 1 and the optimality part of Theo- 
rem |3] implies 



liminf a n (D(p\\a),E 2 \f) > $ 



E 2 



\y/V(pW) 



(6) 
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for any f(n) G o(y/n). Since Eq. (|5|) and Eq. (|6|) together, 
in turn, led to Eq. (|3]), all we need to do is proving The- 
orem |3] □ 

Theorem 3 For quantum hypothesis testing with the 
null hypothesis p® n and the alternative hypothesis a® n , 
and the error probabilities of the first and second kinds 
denoted as a n (A n ) and f3 n (A n ), respectively, we have 
(Achievability): For any E 2 G R and f(n) G o(yfn), there 
exists a sequence of measurements {A n , 1 — A n } n , such 
that 

p n (A n )<vq>{-(nD(p\\o-) + E 2 ^h-+f(n))},(7) 
( E 2 



limsupa n ( J 4 n ) < $ 



\VnpH, 



(8) 



(Optimality): If there is a sequence of measurements 
{A n , 1 — A n } n such that 

(3 n (A n )<exp{-(nD(p\\a)+E 2 ^n~+f{n))} (9) 

holds for given E 2 G R and f(n) G o(y/n), then 



liminf a n (A n ) > $ 



En 



(10) 



Before proving Theorem [3l we do some necessary 
preparation. Write p = ~^2 x X(x)\a x ){a x \ and a = 
J2 y Kv)\by){b v \, wnere {\a x )} x and {\b y )} y , each being an 
orthonormal basis of the Hilbert space H, are the eigen- 
vectors of p and <7, respectively. X(x) and p(y) are the 
corresponding eigenvalues, which satisfy < A(x) < 1, 
< p(y) < 1 and J2xH x ) = Y, y Kv) = 1- Note that 
for the reason stated in remark 2, we assume p(y) ^= 0. 
Let x n denote the sequence x\x 2 ■ ■ ■ x n and y" denote 
2/12/2 • • • Vn- For n copies of the states p and a, we can 
write 



(11) 



with a ,i (.t' 1 ) = nr=i A (^) and k 

\a Xn ), and similarly, 



(8 \a X2 



y" 



(12) 



with p n (y") = UtiKVi) and |6™„) = \b Vl ) ® |6„ 2 ) • ■ • 
|6 y „), where the subscripts of a; and y label which systems 
they belong to. The eigenvectors of p can be written 
as superpositions of the eigenvectors of er, namely, the 
basis {\by)} y . That is, we write \a x ) = ^2 y ^xylby), where 

Ixy = (b y \a x ) G C and Y, x \l*y\ 2 = E y \l*y\ 2 = 1. In 
such a way, we have 



Define a random variable X , with alphabet {xj^Jj and 
probability distribution Px{x) = X(x). Let Y be another 
random variable with alphabet {y} y Jx, which depends 
on X with conditional distribution Py\x{v\ x ) = l7zy| 2 - 
So, the joint distribution of (X,Y) is Px.y(x,y) = 
A(x)|7a;y| 2 . Operationally, this is the probability of 
obtaining (x,y), when we measure the quantum state 
p, sequentially in the bases {|a x )}s and {\b y )} y . Let 
(X n ,Y n ) := (_Yi,Yi)(* aj y 2 ) • • ■ (X n ,y n ) be a sequence 
of independent and identically distributed random vari- 
able pairs, and each (JQ, Y) has the same distribution as 
(X,Y). Then 

n 

Px^m(x n ,y n ) =l[X(x i )\ lxiVi \ 2 = A"(z™M„,„ | 2 . 

i=l 

(14) 

As functions of X and Y, A(X) and //(Y) are also random 
variables, and so are X n {X n ) and p n (Y n ). The following 
lemma expresses the quantum relative entropy and quan- 
tum relative variance as statistical quantities of classical 
random variables. 



Lemma 4 We have 



D(p\\a) = E(x,y) log 



MX) 



X(X) 

V(p\\a) = Var ( x,y) log ^py. 



(15) 
(16) 



./Vote again that we are only interested in the case that a 
has full rank, so p(Y) > (see remark 2). During the 
computation of the right sides of Eq. $15\) and Eq. H6\) . 
if X(x) = 0, we let A(x) log A(x) := limzlogz = 0, and 

X(x) log 2 X(x) := lim zlog 2 z = 0. 

z— >o 

Proof . This is done by direct calculation. For functions 
v and w, it is easy to check that 

Trv(p) =5>(A(s)) - ^ «(A(x))| 7;cy | 2 , 



and 



Tr v(/j)i(;(ct) 
= Tr y^t'(A(.T))|a x )(q z 




«>(Mj/))l&2/>(M 



x y 



v{X{x))w{p,(y))\~f, 



■•■y 



Using these two equations with proper v and w at every 
step when needed, we get 



Tr p(log p - log er) 

(X{x) log X(x)\^/ xy \' 2 - X(x) logp(y)\^ xy 



■^2yA(X) lOgA{X)\^ xy \- - A(X)LOgp[y)\^ xyl 
xy (17) 

^Px,y(x,2/)log A(a;) 
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and 



Trp(logp - log cr) 2 

Tr p log 2 p - 2 Tr(p log /?) log cr + Tr p log 2 cr 

: E ( X (x) log 2 A(x) \ lxy | 2 - 2A(x) log \{x) 
*v (18) 
x log//(2/)|7^| 2 + A(a>) log 2 M(y)l7^ y | 2 ) 



^Px,r(ar,y) log 



= E log 



Eq. ([TTJ confirms Eq. ([15]) . and Eq. $32) and Eq. ([18]) 

together let to Eq. ([TB"]) , thus we finish the proof of 
Lemma |4j □ 
Remark 3: In Q and [l3j], similar classical expressions 
were found for quantum relative entropy and the quan- 
tum Chcrnoff distance, and that for the latter was used 
as a very powerful tool in proving the optimality part of 
quantum Chcrnoff bound. 

Proof of Theorem^' achiev ability. For any fixed E 2 G K 
and f(n) £ o(yfn), let 

L n := exp {nD(p\\a) + E 2 \fn + f{n)} . 

Associated with every x n , we define a projector Q™„ as 

y":A"(an")/p"(y")>L„ 

Write |££»):= <3£n|a£„). Referring to Eq. ([13]), we have 
l£»> = E 7.VI^>- (19) 

S":A"(i")/(i"(s")>L„ 

Let j4„ be the projector onto the space S n that is spanned 
by We claim that the sequence of measurements 

{A n , 1 — A n } n is what we needed: it satisfies Eq. ([7]) and 
Eq. ©. 

Arrange all the values of x n , in such a way that the 
eigenvalues of p® n , A"(a; n )'s, are in an increasing order. 
This gives an ordering to the vectors as well. 

Let g : — * l^™} be the bijection mapping the 
position of x n to x n itself, i.e., x n is at the <7 _1 (a: n )th 
position in the above ordering. Then we have 



A"(s(l)) < A"(ff(2)) < • ■ ■ < \ n (g(\H\ n ))- 



(20) 



Applying a modified Gram-Schmildt othonormalization 
process to the sequence of vectors 

IC(2))' IC(3))'" ' 
we obtain a new sequence of vectors 

IQi))' IQ2))' IQ3))) • ■ ( 21 ) 

The modification is that if e Span ({l^)}}"!), 

(this including the case that \££uy) — 0), we let |Cg(j)) — 



0. As a result, the set of vectors {|£"n)}:r™ consists of an 
othonormal basis of the space S n , plus some zero vectors. 
Thus 



4. = £|&X& 



(22) 



The vectors have another property as follows. 

From the Gram-Schmildt process, we know that 



(23) 



for all 1 < i < \H\ n , with the coefficients s™ e C. Fur- 
ther, from Eqs. (fl9l [20l [23]) . and paying attention to the 
definition of g, we conclude that 



IG>) = E 

y™:A"(>")/^"(j,")>L„ 



t n \b n 



(24) 



where t" nyn G C and 



E Kn y n\ 2 = l. (25) 



•y™:A"(ir")/^"(j/")>L„ 

Eqs. (TTJIMIIIS]) led to 

TrK"|C„)(C„|) 



E i^-ivon 



< 



f:A"(l")/(i"(j")H„ 

A n (x") 

in 



(26) 



So, making use of Eq. ([22]) and Eq. ([26]). we arrive at 
(3 n (An)=Tr<j® n An 

=E Tr K"icxCni) 



< 



E 



A"(x") 

Lyi 



L,, 



= exp {-(n£>(p||<7) + £ 2 V^ + f{n)) } , 



which is Eq. (0. 

On the other hand, Eq. ([8]) is confirmed as follows. Let 



l£»> 







if !€?»> = 



Obviously, |£™>) € S n . So 

\€n)(&\<A 



(27) 



(28) 
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Then we have 
a n (A n ) = 1 - Tr (p® n A n ) 

< l-^A"(o;«)Tr((|<„X<»l)(ie»Xe»|)) 



= l-^A"(a:™)(Cn|C»; 



= 1- £ A"Or")| 7 ;vi 2 

(z",2/"):A"(:c™)//j™(y")>L„ 

= 1- £ Px",y"(^ l ,y") 

(a;",y"):A"(£c")// J "(y")>L„ 



/L« n (y™) 

where the second line is by Eq. and Eq. (|28l) . the 
third line can be see from the definition of and 
the forth line follows from Eq. (TT9")) and the fifth line is 
due to Eq. <g~§. Noting that X n {X n ) = T]-Li H x i) an d 
/i"(y n ) = n"=i MC^t)) an< ^ by taking logarithms at both 
sides, we see that ^„^ y „j < L„ is equivalent to 



g^-Dp. <E 2 + J -± ]= L. 



As a result, we arrive at 
limsup a n (A n ) 

n— ¥ oo 

< lim Pr i v/n f - V log 

n->oo \ n — ' 



KYi) 



(29) 



Since f(n) G o(y / n), we have for any e > 0, there exists 
N(e), such that 

E 2 + l^<E 2 + e 

holds for all n > N(e). This means that, the right side 
of Eq. ([2U]) is no larger than 



with some fixed f'(n) € o(y / n) and lim f'{n) = +oo. 

n— f oo 

This ensures that 

lim L n (3 n {A n ) =0. (31) 

n— foo 

Associated with every x n , we define the projector Q™ n as 

y":A"(a;")/p"(j/")>L„ 

Inserting Eq. (JTTJ) into the definition of a n (A n ), namely, 
a n (A n ) := Tr(p® n (t — A n )), and after a few calculations, 
we get 

a n (A n ) = 1 - C n - D n , (33) 
where C n and D n are 

C n := Y, Tr (q™„ V^Tl 1 V^Q^ ) (34) 

D n := Y Tr ((1 - Q£„)v^K»X<£» ' 

VX(1-Q^))(35) 

The basic difficulty in bounding C„ and D„ is that the 
POVM element A n is very general except for the con- 
straint of Eq. (J9j). Nevertheless, we will be able to show 
that the D n term is asymptotically negligible, due to the 
constraint of Eq. ((9]) and our choice of L n . This in turn, 

! ensures that the C n term can be upper bounded by re- 
moving the operator "vM.„" from its expression, with 
only an infinitesimal correction (see Eq. (|44[0 . 

Now, we show that the D n term is asymptotically neg- 
ligible. Because 

a® n > (1 - Q"»)o-®"(l - Q£„) 
A"(x") 



lim Pr < y/n 



\ i=i 



X(X t ) 



D(p\\a) \<E 2 + e}, 



which, by Lemma|4]and the central limit theorem, equals 



to $ 



E 2 +e 



Hence, 



limsupa„(A„) < $ 



E 2 +e 



Because e is arbitrary, we conclude the proof of Eq. (JSj) . 



□ 



Proof of Theorem fJl' optimality. Suppose that the se- 
quence of measurements {A n , 1 — A n } n satisfies Eq. (J5]). 
We will prove Eq. (JTUJ). Let 

L n :=exv{(nD(p\\o-)+E 2 V^ + f(n))-f'{n)} (30) 



> 



Ln 



where the first inequality is owing to the commutativity 
of a® n and the projector (1 — Q™n), and the second one 
can be see from the definition of <5"« , we obtain 

f3 n (A n ) = Tr(a® n A n ) 

= Tr (>"V^(E K»X«S»l) V^) 

= ^Tr(>" (VA|<»X<»|V^ 
> ETr ^»(,«) 



^-g:„)( v /^ n |<„X<„| x /< 



This result, which is rephrased as D n < L n f3 n {A n ) 1 to- 
gether with Eq. ([3~Tj) tells us that 



lim D„ = 0. 



(36) 
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The evaluation of the C„ term will be a bit more com- 
plicated. For simplicity, we use the notation of norm, 
|| • |j, defined as 



Mil : = V M<^> = n/T 1 ' MM- 



(37) 



for \ip) being a vector of some Hilbert space. Thus, C„ is 
rewritten as 



EA"M*)feV^K" 



(38) 



Our strategy is to divide the terms in the sum of the 
above expression, into different classes, each satisfies 
some special conditions. Then we evaluate them indi- 
vidually under these conditions. For such a purpose, we 
define index sets 



Ol := {x 



A n \a™n 



> e 



) 



Ol := {^|||(l-QS„) A /i;K„)||<e 1 e 2 } 

with sufficiently small ei, £2 > 0. Denote the full set of all 
the x n, s as n , and the complementary sets of 0™ an d 
C>2 as 0™ an d O2 , respectively. Since n is the union of 
three disjoint subsets 



O' 



Ofu(o;no 2 n )u(o;n0 2 n ), 



we deal with Eq. (|38| under distinct cases that x n belongs 
to these subsets respectively, and then sum them up. 

The first case is that x n £ 0™. Noting that a projector 
(more generally, any contraction operator whose singular 
values are no more than 1) acting on a vector will only 
reduce its norm, we have 



Y^ n ^ n )\\Q n ^VK\a n xn )f 

< X^A'V l H<£A"(*") e ? 

x n eOf x " 



(39) 



-el 



The second case is that x n £ 0" n 02 ■ We upper bound 
it as 



E A "^")l 
< E A "^") 



^nKn>| 



< j2 \ n (x n )4?\\(i-Q5»)VA;\> 



(40) 



-1 c 2 

A, 

"2,2 ' 
e l e 2 



where the second line follows from inequalities 

||Q£»V^KS»)|| < ||VXK»>|| < ||K»>|| = 1 and en- 
larging the range of x n in the sum, the third line is by the 
definition of 0J? , in the fourth line, we enlarge the range 
of x n further, and for the last line, it can be easily seen 
from Eq. ([33]) and Eq. (f3"T)l . The last case, which will 
turn out to be the dominant part, is that x n £ 0" n 2 . 
In such a case, paying attention to the definition of 0™ 
and 02 , we see that 



'An |a"n 



Q2« 



1 A n \a X 7i 



|(l-Qg„)^K„ 



< 



£ie 2 



£2- 



Then, directly applying Lemma El which will be pre- 
sented and proven later, we get 



A n |a™„ 



< 



'A n \a x n 



+ 2V2e 2 . 

(41) 



Since < A n < 1, it holds that A n < v^n- As a result, 



< 



«„|v^K-) 



(a™n|A n |a"« 



^4„|a"„ 



(42) 



The last term of Eq. (g5J| and the left side of Eq. (gTJl arc 
actually the same. So, combining these two equations 
together, and noting that the right side of Eq. (j4Tj) is 
obviously upper bounded by 

\\QU<~)\\ 2 + 2^262, 

we arrive at 

||QS„Vi;K n )|| 2 < ||Q^K„)|| 2 + 2V2e 2 . 

Thus, 

,2 



E X n (x n )\\Q^VAi\^ 

< E A"(x")(||Q^K„)|| 2 + 2N/2e 2 ) (43) 

< E A "( x ")llQ""l a "")l| 2 + 2V 2 e 2. 

Now, adding Eq. ([39]). Eq. gUJ) and Eq. (J43J) together, 
we obtain from Eq. (|38l) that 



c„<EA ra (^)||Q:«K« 



A. 
,2,2 



-el + 2V2e 2 . (44) 



7 



In analogy to what we did during the derivation of Eq. J5]) 
in the proof of the achievability part of Theorem |3l the 
limit of the first term of the right side of Eq. (|4"4")) is 
computed to be 



lim £A"0r")||Q£„K„) 



lim J2 \ n (x n )h^yn\ 2 



\ n (X n ) 



lim Pr (Xnj y 



/i n (7 n ) 

hm Pr (x -,y")W«( - Vlog 



D(p\\a) 



1=1 
> E 2 



^(/(») -/'(»))} 

(45) 



/ ^2 
= 1 - $ - 



where the first equality can be see from Eq. (fl3[) and 
Eq. (1521) . the second and third equalities make use of 
Eq. (|14l) and Eq. (|30|) . respectively, the last equality fol- 
low from Lemma [4] and the central limit theorem, as 
well as the fact that /(n),/'(n) € o(y / n). Hence, due 
to Eq. (|36|) and Eq. (|45|) . and then noting that £1 , e 2 > 
can be arbitrarily small, we get from Eq. (|4"4")l that 



lim sup C n < 1 — $ 



So 



(46) 



Eventually, inserting Eq. and Eq. (|4"6")l into 

Eq. dnnj) results in 



liminf a n (A n ) > $ 




Proof . We show Eq. (|47[) as follows. 

IKl^Dl^f-HH^K)^)!! 2 

=(11(1^1)1^11 + ||H0X0W^>||) 

x (||(l^l)l^)|| -|M^kM||) 
<2(||(|^l)lv>||-||H^K)lv>||) 
<2||(|^l)lv>-W^k)lv>|| 

=2|j((^|(l - n)\<p))n\<l>) + - 

=2^/|<0|(l - 7r)|^)| 2 • ||7r|0)|| 2 + |(0|^)| 2 • ||(1 - tt)|0>| 



<2 



•1 + 1- 



<2\fs 



= 2\/2, 



where the fifth line is by the triangle inequality, the sev- 
enth line is due to the Pythagoras' theorem, and the other 
lines are trivially by direct calculations and the condi- 
tions stated in the lemma. 

□ 

Conclusion, we have obtained the second order asymp- 
totics for quantum hypothesis testing. Our result deep- 
ens the quantum Stein's lemma , in a way just like the 
central limit theorem does to the law of large numbers. 
Our method is elementary, based on basic linear algebra 
and probability theory. It deals with the achievability 
part and the converse part in a unified framework, with 
a clear geometric picture. We hope that the new result 
obtained and the new method employed, in this paper, 
will find important applications in other aspects of quan- 
tum information and, non-commutative probability and 
statistics. 

Acknowledgments. The author would like to thank 
Dr. William Matthews for introducing the results of [25[ 
and [26|, and Prof. Shunlong Luo for a conversation on 
related topics. He is especially grateful to Prof. Masahito 
Hayashi for many helpful discussions. The Centre for 
Quantum Technologies is funded by the Singapore Min- 
istry of Education and the National Research Foundation 
as part of the Research Centres of Excellence programme. 



which is Eq. ()10|) and we finish the proof of the optimality 
part of Theorem [3] □ 



Lemma 5 Let |</>) and \ip) be normalized vectors in some 
Hilbert space. Let n be a projector, if \\\(f>) — n\(f>)\\ < e, 
then 



|(l^l)k)|| 2 - \\{Am\*M\\ 2 < 2V2e. (47) 
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